© 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



© Publication number: 



0 440 028 A2 



© 



© Application number: 91100251.7 
© Date of filing: 09.01.91 



EUROPEAN PATENT APPLICATION 

© Int. OA H04Q 1/45 



2! 

00 
CM 



© Priority: 29.01.90 US 471500 
13.08.90 US 567233 

@ Date of publication of application: 
07.08.91 Bulletin 91/32 

© Designated Contracting States: 

DEFRGB ' 

© Applicant: DIALOGIC CORPORATION 
300 Littleton Road 

Parsippanny, New Jersey 07054(US) 



© Inventor: Hamilton, Chris 
41 Harrison Avenue 
Montclalr, New Jersey 07042(US) 

© Representative: Herrmann-Trentepohl, 
Werner, Dipl.-lng. et al 
Herrmann-Trentepohl, Kirschner, Grosse, 
Bockhorni & Partner Forstenrieder Allee 59 
W-8000 Munchen 71 (DE) 



© Multifrequency tone signal detector. 

© An MF digit or DTMF digit detector identifies MF 
digits or DTMF digits in a signal and eliminates 
talkoff as a source of false detection by analyzing 
the frequency spectrum of the signal which is pro- 



vided by FFT and by using the history of the signal 
to determine whether an MF digit or a DTMF digit 
was simulated by a human voice. 
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MULTIFREQUENCY TONE SIGNAL DETECTOR 



This is a continuation-in-part of patent applica- 
tion Serial No. 07/471,500 filed January 29, 1990. 

Technical field of the Invention 

The present invention pertains to the field of 
detecting and identifying telephone signals and, in 
particular, to method and apparatus for detecting 
and identifying multifrequency signals -including 
multifrequency tone (MF) signals and dual tone 
multifrequency signals (DTMF) signals-which oc- 
cur in a telephone network such as the Public 
Switched Telephone Network (PSTN). 

Background of the Invention 

It is well known to those of ordinary skill in the 
art that automated systems for use in telecommuni- 
cations applications typically require a capability of 
detecting and identifying signals which are trans- 
mitted over a telephone network, such as, for ex- 
ample, the Public Switched Telephone Network 
(PSTN) in order to function properly. For example, 
it is well known that, under certain conditions, spe- 
cific machine-generated, single-frequency signals, 
denoted as "call-progress" signals, are transmitted 
over a telephone network to indicate call status 
conditions. In addition to machine-generated, 
single-frequency signals, it is well known that auto- 
mated systems for use in telecommunications 
voice processing applications often require the ca- 
pability of detecting and identifying machine-gen- 
erated, communication signals such as ringback, 
dial tone, multifrequency signals such as, for exam- 
ple, multifrequency tone (MF) signals and dual tone 
multifrequency (DTMF) signals, and so forth, as 
well as signals which are produced by human 
speech, noise and hum, and so forth. Further, it is 
well known that the automated system advanta- 
geously should have the capability to perform such 
detection and identification on analog signals as 
well as on digital signals. 

An MF and DTMF detector is apparatus for 
detecting and identifying MF digits or DTMF digits 
in a digital or analog signal which is transmitted, for 
example, over the PSTN. As is well known to those 
of ordinary skill in the art, an MF digit or a DTMF 
digit typically is detected and identified by compar- 
ing the frequency characteristics of an input signal 
with the frequency characteristics of an MF digit or 
a DTMF digit, respectively. If the frequency char- 
acteristics of the input signal and an MF digit or 
DTMF digit, respectively, are close to each other, 
within a predetermined degree, the detector gen- 
erates a detection and identification signal which 



indicates the identity of the detected MF digit or 
DTMF digit. In practice, one well known method for 
determining the frequency characteristics of a digi- 
tal or analog signal is to perform a Fourier spectral 
5 analysis on a short time duration segment of the 
signal. Such a Fourier spectral analysis yields a set 
of numbers which represents the distribution of 
energy for the segment over the frequency spec- 
trum. Since an MF digit comprises one of fifteen 
10 (15) possible pairs of predetermined MF fundamen- 
tal tones and a DTMF digit comprises one of 
sixteen (16) possible pairs of predetermined DTMF 
fundamental tones, the frequency spectrum of the 
input signal may be compared with the frequency 
75 spectrum of fifteen (15) possible MF digits -fifteen 
(15) possible MF fundamental tone pairs- or six- 
teen (16) possible DTMF digits -sixteen (16) possi- 
ble DTMF fundamental tone pairs-, respectively. If 
the frequency spectrum of the input signal matches 
20 the frequency spectrum of one of the MF digits or 
DTMF digits, respectively, to a predetermined de- 
gree, the MF and DTMF detector generates a sig- 
nal which indicates: (a) that an MT digit or a DTMF 
digit, respectively, had been detected in the input 
25 signal and (b) the identity of the detected MT digit 
or DTMF digit respectively. 

While the above describes a detector which is 
sufficient for detecting and identifying MF digits 
and DTMF digits in many applications, there is a 
30 commonly occurring class of events which can fool 
such a detector. For example, talkoff is a com- 
monly occurring event which can cause false de- 
tection by such MT and DTMF detectors. Talkoff is 
an event wherein an MT digit or a DTMF digit is 
35 simulated by an input signal which is produced by 
a human voice. In particular, when a person 
speaks, the frequency spectrum of his or her voice 
will occasionally simulate the frequency spectrum 
of an MF digit or a DTMF digit and will, thereby, 
40 cause false detection by MF and DTMF detectors 
of the type described above. 

As one can readily appreciate from the above, 
there is a need in the art for method and apparatus 
for detecting and identifying MF digits and DTMF 
45 digits in analog signals as well as in digital signals 
which eliminates talkoff as a source of false detec- 
tion. In addition, there is a need for such a method 
and apparatus which can be easily customized for 
particular applications such as. for example, for low 
50 talkoff or for low signal-to-noise. 

Summary of the Invention 

Embodiments of the present invention advanta- 
geously solve the above-identified need in the art 
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by providing method and apparatus for detecting 
and identifying MF digits and DTMF digits in an 
input signal. If the input signal is an analog signal, 
In accordance with the present invention, samples 
are taken at a predetermined sampling rate and the 
samples are converted into a linear pulse code 
modulated (PCM) digital format. On the other hand, 
if the input signal is a digital signal, in accordance 
with the present invention, the digital signal values 
are transformed into a linear PCM digital format if 
the input signal is not already thusly encoded, i.e., 
if the input digital signal had been encoded using a 
u-law or an A-law PCM format, the digital signal 
values must be converted into a linear PCM format. 

Next, in accordance with the present invention, 
a Fourier spectral analysis is performed on short 
time duration segments of the input signal, which 
short time duration segments are referred to as 
frames. In particular, a frame is comprised of a 
predetermined number of samples of an input ana- 
log signal or a predetermined number of values of 
an input digital signal, all in linear PCM format 
Further, the Fourier spectral analysis of a frame is 
obtained by a Discrete Fourier Transform (DFT), 
which DFT provides the frequency spectrum of the 
frame. In particular, in preferred embodiments of 
the present invention, the DFT is a Fast Fourier 
Transform (FFT) which is performed by a Digital 
Signal Processor (DSP). 

As an example of the above, in a preferred 
embodiment of the present invention for use in 
analyzing an analog signal which is transmitted 
over the PSTN and which has a 4000 Hz band- 
width, the analog signal is sampled, in accordance 
with the Nyquist criterion, at least 8000 times/sec 
and the predetermined number of samples or val- 
ues per frame is chosen to be 256. Further, to 
increase temporal resolution, a frame of 256 sam- 
ples or values which is Fourier analyzed by the 
FFT is comprised as follows. The "present" frame 
comprises the last 128 samples or values from the 
previous frame and the next or "new" 128 samples 
or values which have been obtained from the input 
signal. As a result, the "next" frame to be Fourier 
analyzed by the FFT after the "present" frame 
comprises the 128 "new" samples or values from 
the "present" frame and the next 128 samples or 
values obtained from the input signal. Then, prior 
to calculating the FFT, each sample or value S n - 
(where n = 0, .... 255) is multiplied by a number 
w n which is given by w n = sin(2TIn/512). This 
windowing function is well known to those of or- 
dinary skill in the art and reduces anomalous spec- 
tral components due to the finite frame length of 
256 samples. 

Each frequency bin in the frequency spectrum 
has a bin index denoted by n. However, because 
the input signal is real, only the first 128 bins are of 



interest since the last 128 bins are identical to the 
first 128 bins. The real and imaginary coefficients 
determined by the FFT for each frequency bin in 
the frequency spectrum are squared and summed 

5 to provide a bin energy e(n) for each frequency bin 
in the frequency spectrum and, in addition, the bin 
energies for each bin are summed to provide the 
total energy ETOT for the frame. Next, a predeter- 
mined number of energy maxima in the frequency 

70 spectrum of the frame are determined. An energy 
maximum is defined as the occurrence of a bin in 
the frequency spectrum of a frame which has more 
energy than its adjacent sidebins and, in accor- 
dance with a preferred embodiment of the present 

16 invention, the only energy maxima determined are 
the three largest in the spectrum. Next, the follow- 
ing tests are performed. 

The first test determines whether the total en- 
ergy of the frame, ETOT, falls below a predeter- 

20 mined threshold for MF and a predetermined 
threshold for DTMF, respectively. If the total en- 
ergy falls below a particular predetermined thresh- 
old then the input signal cannot be that particular 
type of digit 

25 The second test compares the frequencies of 

the fundamental energy of the two largest maxima 
with the frequencies of each of the fifteen (15) 
possible MF digits -for MF- or with the frequen- 
cies of each of the sixteen (16) possible DTMF 

30 digits -for DTMF. Each MF digit is comprised of a 
pair of MF fundamental signals -each of which MF 
fundamental signals has an energy substantially at 
a predetermined frequency- and each DTMF digit 
is comprised of a pair of DTMF fundamental sig- 

35 nals -each of which DTMF fundamental signals 
has an energy substantially at a predetermined 
frequency. In particular, for MF digits, there are six 
(6) MF fundamental signals and, for DTMF digits, 
there are eight (8) DTMF fundamental signals (four 

40 lowband signals and four highband signals), and a 
DTMF digit is comprised of one of the four possi- 
ble DTMF fundamental lowband signals and one of 
the four possible DTMF highband signals. Thus, if 
an MF digit or a DTMF digit is present, most of the 

45 energy in the frame will reside in bins correspond- 
ing to the frequency of the MF fundamental signals 
of the MF digit or in bins corresponding to the 
frequency of the DTMF fundamental signals of the 
DTMF digit, respectively. As should be understood 

so by those of ordinary skill in the art, because an MF 
fundamental signal or a DTMF fundamental signal 
is often not centered on a particular frequency bin 
in the frequency spectrum, some energy from an 
MF fundamental or a DTMF fundamental will be 

55 distributed to adjacent bins which are denoted as 
sidebins. In accordance with the present invention, 
the fundamental energy is defined as the sum of 
the energy in a peak of the frequency spectrum 
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and the energy of the peak's largest sidebin. Of 
course, it should be clear to those of ordinary skill 
in the art that these two bins could have equal 
energy. If a match is found with two fundamental 
frequencies of an MF digit or a DTMF digit, respec- 5 
tiveiy, we have a possible MF digit or DTMF digit, 
respectively. This test permits the frequencies of 
the two largest maxima to deviate from the MF 
fundamental frequencies or DTMF fundamental fre- 
quencies, respectively, to which they are closest to 
by more than a predetermined amount. For exam- 
ple, according to well known, standard telephone 
transmission criteria, one can have a valid MF digit 
when there is a 5 Hz frequency deviation in the MF 
fundamental tones and one can have a valid DTMF 15 
digit where there is a 1.5 % frequency deviation in 
the DTMF fundamental tones. However, for a 
DTMF digit, a deviation of over 3.5 % is cause for 
rejection for DTMF digits. 

The third test determines whether the total 20 
energy for this frame is less than the total energy 
for the previous frame and, if the total energy is 
decreasing, the MF digit or DTMF digit, respec- 
tively, is rejected. The MF digit or DTMF digit is 
rejected in accordance with the present invention 25 
because it has been determined that it is highly 
unlikely that the inception of an MF digit or a 
DTMF digit would have less energy than occurs in 
the previous frame. 

The fourth test determines whether the ratio of 30 
the total energy to the non-MF energy or non- 
DTMF energy, respectively, in the frame is less 
than a predetermined constant BROADMF or 
BROADDTMF, respectively, and, if it is not, we still 
have a possible valid MF digit or DTMF digit, 35 
respectively. The non-MF energy or non-DTMF en- 
ergy is determined to be equal to the difference 
between ETOT and the MF digit energy or the 
DTMF digit energy, respectively, where the MF 
digit energy or the DTMF digit energy equals the 40 
sum of the MF fundamentals or the DTMF fun- 
damentals, respectively. Thus, if the ratio of the 
total energy to non-MF energy is too small, we do 
not have a valid MF digit or if the ratio of the total 
energy to non-DTMF energy is too small, we do 45 
not have a valid DTMF digit. 

The fifth test determines the signal-to-noise 
ratio for this frame. The ratio of the MF digit energy 
or the DTMF digit energy and the energy of the 
third maximum is compared with SNRMF or so 
SNRDTMF, respectively, where SNRMF and 
SNRDTMF are predetermined constants. If the ratio 
of the MF digit energy or the DTMF digit energy to 
the energy of the third maximum, i.e., the signal-to- 
noise ratio, is greater than SNRMF or SNRDTMF, 55 
respectively, then we still have a possible valid MF 
digit or DTMF digit, respectively. 

The sixth test involves twist, i.e., the ratio of the 



fundamental energy of the larger energy MF fun- 
damental of the MF digit to the fundamental energy 
of the smaller energy MF fundamental -the fun- 
damental energy is the sum of the maximum en- 
ergy and the energy of the largest sidebin— or the 
ratio of the fundamental energy of the larger en- 
ergy DTMF fundamental of the DTMF digit to the 
fundamental energy of the smaller energy DTMF 
fundamental. If the MF ratio is less than TWISTMF 
or the DTMF ratio is less than TWISTDTMF, where 
TWISTMF and TWISTDTMF are predetermined 
constants, we still have a possible valid MF digit or 
DTMF digit, respectively. 

The seventh test determines taikoff. In order to 
do so, the following quantities are determined. ED 
and ES are the difference and sum, respectively, of 
ETOT for the present frame and for the second 
previous frame, respectively. If the ratio of ES to 
ED is larger than EDEVMF or EDEVDTMF, respec- 
tively, where EDEVMF and EDEVDTMF are pre- 
determined constants, then a decision is made that 
a human voice simulated an MF digit or a DTMF 
digit, respectively, i.e., taikoff has occurred. It is 
important to note that embodiments of the present 
invention provide excellent determination of taikoff 
because the embodiments take into account the 
history of the input signal. 

Brief Description of the Drawing 

A complete understanding of the present inven- 
tion may be gained by considering the following 
detailed description in conjunction with the accom- 
panying drawing, in which: 

FIG. 1 shows, in pictorial form, a block diagram 
of a preferred embodiment of the inventive ap- 
paratus for detecting and identifying MF digits 
or DTMF digits in telephone signals; and 
FIG. 2 shows a flow chart of a microprocessor 
program which forms part of the preferred em- 
bodiment of the inventive apparatus shown in 
FIG. 1. 

To facilitate understanding, identical reference 
numerals have been used to denote identical ele- 
ments common to the figures. 

Detailed Description 

FIG. 1 shows a block diagram of a preferred 
embodiment of inventive apparatus DM 10 and the 
manner in which it is used for detecting and iden- 
tifying MF and DTMF digits in telephone signals. 
As shown in FIG. 1, analog telephone signal 100 
from telephone network 20 is transmitted by tele- 
phone network interface 25 to DM 10 as signal 110. 
Many apparatus for use as telephone interface 25 
are well known to those of ordinary skill in the art 
For example, one such apparatus comprises a por- 
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tion of a DIALOG/41 Digitized Voice and Telephony 
Computer Interface circuit which is available from 
Dialogic Corporation, 300 Littleton Road, Parsip- 
pany, New Jersey 07054. In pertinent part, this 
circuit comprises well known means for interfacing 
with the telephone network to send and receive 
calls; means, such as transformers, to electrically 
isolate subsequent circuits; and filter circuits. 

Signal 110 which is output from telephone net- 
work interface 25 is applied as input to DM 10 and, 
in particular, to ancillary hardware 70. Specifically, 
signal 110 is applied to a sample and hold circuit 
(not shown) in ancillary hardware 70, which sample 
and hold circuits are well known to those of or- 
dinary skill in the art. 

The output from the sample and hold circuit 
contained in ancillary hardware 70 is applied to 
linear PCM analog-to-digital converter 40. There 
are many circuits which are well known to those of 
ordinary skill in the art that can be .used as linear 
PCM analog-to-digital converter 40. The encoded 
signal output from analog-to-digital converter 40 is 
placed, sample by sample, into a tri-state buffer 
(not shown) for subsequent transmittal to a data 
bus (not shown). A tri-state buffer for performing 
this function is well known to those of ordinary skill 
in the art. For example, the tri-state buffer may be 
a Tl 74LS244 tri-state buffer which is available from 
Texas Instruments of Dallas, Texas, or any other 
such equipment. 

DM 10 further comprises microprocessor 50, 
memory 60, Digital Signal Processor (DSP) 65, and 
a portion of ancillary hardware 70 for use in inter- 
facing with a host computer 30. DSP 65 may be 
any one of a number of digital signal processors 
which are well known to those of ordinary skill in 
the art such as, for example, a Motorola 56000 
processor and microprocessor 50 may be any one 
of a number of microprocessors which are well 
known to those of ordinary skill in the art such as 
an INTEL 286 microprocessor which is available 
from INTEL of Santa Clara, California, or any other 
such equipment. Memory 60 may be any one of a 
number of memory equipments which are well 
known to those of ordinary skill in the art such as 
an HITACHI 6264 RAM memory which is available 
from HITACHI America Ltd. of San Jose, California, 
or any other such equipment. The portion of an- 
cillary hardware 70 which interfaces with host com- 
puter 30 may be readily fabricated by those of 
ordinary skill in the art by using circuits which are 
also well known to those of ordinary skill in the art. 
For example, the portion of ancillary hardware 70 
which interfaces with host computer 30 may be 
comprised of Tl 74LS245 data bus transceivers, Tl 
74LS244 address buffers, and Tl PAL 16L8 control 
logic, all of which is available from Texas Instru- 
ments of DaJlas, Texas, or any other such equip- 



ment. Finally, as shown in FIG. 1, DM 10 interfaces 
with host computer 30, which may be any one of a 
number of computers which are well known to 
those of ordinary skill in the art such as, for exam- 
s pie, an IBM PC/XT/AT, or any other such equip- 
ment. 

The encoded digital samples output from linear 
PCM analog-to-digital encoder 40 are placed in the 
buffer (not shown) and are output, in turn, there- 
jo from to the data bus (not shown). Then, the digital 
samples are received from the data bus, digital 
sample by digital sample, by microprocessor 50. 
Microprocessor 50, in accordance with the present 
invention and as will be described in detail below, 

is places a predetermined number of digital samples 
on the data bus for receipt and analysis by DSP 
65. The output from DSP 65 is placed on the data 
bus for transmittance to microprocessor 50. Then, 
as will be described in detail below, microproces- 

20 sor 50, in conjunction with a program and data 
stored in memory 60, analyzes the DSP output to 
detect and identify an MF digit or a DTMF digit in 
the input signal and, in response thereto, to gen- 
erate and transmit a signal to host computer 30. As 

25 is well known to those of ordinary skill in the art, 
host computer may be a part of an interactive 
system whose operation entails receiving informa- 
tion from a human by means of DTMF digits. As 
such, the interactive system of which host com- 

30 puter 30 is a part utilizes the signal provided by 
DM 10 to generate further signals to obtain further 
information from the human or it performs oper- 
ations and then transmits further information to the 
human. Such systems are well known in the art 

35 and, for simplicity, their detailed operation need not 
be set forth here. 

As stated above, apparatus DM 10 detects and 
identifies MF digits or DTMF digits in telephone 
signal 100. In so doing, DM 10 determines the 

40 energy content of predetermined portions of the 
frequency spectrum of incoming telephone signal 
100. In addition, if input telephone signal 100 is not 
an analog signal, as is the case for the embodi- 
ment shown in FIG. 1. but is instead a digital 

45 signal, embodiments of the present invention con- 
vert the digital values of the input signal into a 
linear PCM digital format For example, if the input 
digital signal values had been encoded using u-law 
or A-law PCM, they are converted into a linear 

so PCM format. This conversion is performed in ac- 
cordance with methods and apparatus which are 
well known to those of ordinary skill in the art such 
as, for example, by using a lookup table stored in 
memory 60. Nevertheless, in describing the inven- 

55 tive method and apparatus, for ease of understand- 
ing, we will refer to the linear PCM digital format 
samples which are output from analog-to-digital 
encoder 40 as digital samples. 
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The digital samples are input into DSP 65 
where a Fast Fourier Transform (FFT) is performed 
to produce the frequency spectrum of input signal 
100. Then, microprocessor 50 analyzes the fre- 
quency spectrum output from DSP 65 to detect 
and identify MF digits or DTMF digits under the 
direction of a software program which performs in 
accordance with the flow chart shown in FIG. 2. 

Before discussing how DM 10 detects and 
identifies MF digits or DTMF digits in accordance 
with preferred embodiments of the present inven- 
tion, we will first describe how DM 10 operates in 
general to detect and identify MF digits or DTMF 
digits. 

In accordance with the present invention, a 
Fourier spectral analysis is performed on short time 
duration segments of the input signal, which short 
time duration segments are referred to as frames. 
In particular, a frame is comprised of a predeter- 
mined number of samples of an input analog signal 
or a predetermined number of values of a input 
digital signal, all in linear PCM format Further, the 
Fourier spectral analysis of a frame is obtained by 
a Discrete Fourier Transform (OFT), which OFT 
provides the frequency spectrum of the frame. In 
particular, in preferred embodiments of the present 
invention, the DFT is a Fast Fourier Transform 
(FFT) which is performed by DSP 65. 

In the preferred embodiment of the present 
invention shown in FIG. 1 for use in analyzing 
analog signal 100 which is transmitted over the 
PSTN and which has a 4000 Hz bandwidth, analog 
signal 100 is sampled, in accordance with the 
Nyquist criterion, at least 8000 times/sec and the 
predetermined number of samples or values per 
frame is chosen to be 256. Further, in order to 
increase temporal resolution, a frame of 256 values 
which is input to DSP 65 for Fourier analysis is 
comprised as follows. The "present" frame com- 
prises the last 128 values from the previous frame 
and the next or "new" 128 values which have been 
obtained from input signal 100. As a result, the 
"next" frame to be Fourier analyzed by the FFT 
after the "present" frame comprises the 128 "new" 
values from the "present" frame and the next 128 
values obtained from input signal 100. Then, prior 
to calculating the FFT, each sample or value S n - 
(where n = 0, 255) is multiplied by a number 
w n which is given by w n = sin(2TIn/512). This 
windowing function is well known to those of or- 
dinary skill in the art and reduces anomalous spec- 
tral components due to the finite frame length of 
256 samples. 

As a result of the above, when DSP 65 of FIG. 
1 is embodied in a Motorola 56000 DSP and 256 
samples are used to perform a Fast Fourier Trans- 
form (FFT), a 128 bin frequency spectrum for the 
input signal is produced wherein the frequency 



bins are 31.25 Hz wide. Each frequency bin in the 
frequency spectrum has a bin index denoted by n. 
However, because the signal is real, only the first 
128 bins are of interest since the last 128 bins are 
6 identical to the first 128 bins. The real and imagi- 
nary coefficients determined by the FFT for each 
frequency bin are squared and summed to provide 
a bin energy e(n) for each frequency bin in the 
frequency spectrum and, in addition, the energies 
w for each bin are summed to provide the total en- 
ergy ETOT for the frame. 

Next, a predetermined number of energy maxi- 
ma in the frequency spectrum of the frame are 
determined. An energy maximum is defined as the 
75 occurrence of a bin in the frequency spectrum of a 
frame which has more energy than its adjacent 
sidebins and, in accordance with a preferred em- 
bodiment of the present invention, the only energy 
maxima determined are the three largest in the 
20 spectrum. Next, the following tests are performed. 

The first test determines whether the total en- 
ergy of the frame, ETOT, falls below a predeter- 
mined threshold for MF and a predetermined 
threshold for DTMF, respectively. If the total en- 
25 ergy falls below a particular predetermined thresh- 
old then the input signal cannot be that particular 
type of digit. 

The second test compares the frequencies of 
the fundamental energy of the two largest maxima 
30 with the frequencies of each of the fifteen (15) 
possible MF digits for MF digit detection or with 
the frequencies of each of the sixteen (16) possible 
DTMF digits for DTMF digit detection. Each MF 
digit is comprised of a pair of MF fundamental 
35 signals —each of which MF fundamental signals 
has an energy substantially at a predetermined 
frequency- and each DTMF digit is comprised of a 
pair of DTMF fundamental signals -each of which 
DTMF fundamental signals has an energy substan- 
40 tiaily at a predetermined frequency. In particular, 
for MF digits, there are six (6) MF fundamental 
signals and, for DTMF digits, there are eight (8) 
DTMF fundamental signals (four lowband signals 
and four highband signals), and a DTMF digit is 
45 comprised of one of the four possible DTMF fun- 
damental lowband signals and one of the four pos- 
sible DTMF highband signals. Thus, if an MF digit 
or a DTMF digit is present, most of the energy in 
the frame will reside in bins corresponding to the 
so frequency of the MF fundamental signals of the MF 
digit or in bins corresponding to the frequency of 
the DTMF fundamental signals of the DTMF digit, 
respectively. As should be understood by those of 
ordinary skill in the art, because an MF fundamen- 
55 tal signal or a DTMF fundamental signal is often 
not centered on a particular frequency bin in the 
frequency spectrum, some energy from an MF 
fundamental or a DTMF fundamental will be distrib- 
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uted to adjacent bins which are denoted as 
sidebins. In accordance with the present invention, 
the fundamental energy is defined as the sum of 
the energy in a peak of the frequency spectrum 
and the energy of peak's largest sidebin. Of 
course, it should be clear to those of ordinary skill 
in the art that these two bins could have equal 
energy. If a match is found with two fundamental 
frequencies of an MF digit or DTMF digit, respec- 
tively, we have a possible MF digit or DTMF digit, 
respectively. This test permits the frequencies of 
the two largest maxima to deviate from the MF 
fundamental frequencies or DTMF fundamental fre- 
quencies, respectively, to which they are closest 
by more than a predetermined amount. For exam- 
ple, according to well known, standard telephone 
transmission criteria, one can have a valid MF digit 
when there is a 5 Hz frequency deviation in the MF 
fundamental tones and one can have a valid DTMF 
digit where there is a 1 .5 % frequency deviation in 
the DTMF fundamental tones. However, for a 
DTMF digit, a deviation of over 3.5 % is cause for 
rejection. 

The third test determines whether the total 
energy for this frame is less than the total energy 
for the previous frame and, If the total energy is 
decreasing, the MF digit or DTMF digit, respec- 
tively, is rejected. The MF digit or DTMF digit is 
rejected in accordance with the present invention 
because it has been determined that it is highly 
unlikely that the inception of an MF digit or a 
DTMF digit would have less energy than occurs in 
the previous frame. 

The fourth test determines whether the ratio of 
the total energy to the non-MF energy or non- 
DTMF energy, respectively, in the frame is less 
than a predetermined constant BROADMF or 
BROADDTMF, respectively, and, if it is not, we still 
have a possible valid MF digit or DTMF digit, 
respectively. The non-MF energy or non-DTMF en- 
ergy is determined to be equal to the difference 
between ETOT and the MF digit energy or the 
DTMF digit energy, respectively, where the MF 
digit energy or the DTMF digit energy equals the 
sum of the MF fundamentals or the DTMF fun- 
damentals, respectively. Thus, if the ratio of the 
total energy to non-MF energy is too small, we do 
not have a valid MF digit or if the ratio of the total 
energy to non-DTMF energy is too small, we do 
not have a valid DTMF digit. 

The fifth test determines the signal-to-noise 
ratio for this frame. The ratio of the MF digit energy 
or the DTMF digit energy and the energy of the 
third maximum is compared with SNRMF or 
SNRDTMF, respectively, where SNRMF and 
SNRDTMF are predetermined constants. If the ratio 
of the MF digit energy or the DTMF digit energy to 
the energy of the third maximum, i.e., the signal-to- 



noise ratio, is greater than SNRMF or SNRDTMF, 
respectively, then we still have a possible valid MF 
digit or DTMF digit, respectively. 

The sixth test involves twist, i.e., the ratio of the 

s fundamental energy of the larger energy MF fun- 
damental of the MF digit to the fundamental energy 
of the smaller energy MF fundamental -the fun- 
damental energy is the sum of the maximum en- 
ergy and the energy of the largest sidebin™ or the 

10 ratio of the fundamental energy of the larger en- 
ergy DTMF fundamental of the DTMF digit to the 
fundamental energy of the smaller energy DTMF 
fundamental. If the MF ratio is less than TWISTMF 
or the DTMF ratio is iess than TWISTDTMF, where 

75 TWISTMF and TWISTDTMF are predetermined 
constants, we still have a possible valid MF digit or 
DTMF digit, respectively. 

The seventh test determines taikoff. In order to 
do so, the following quantities are determined. ED 

20 and ES are the difference and sum, respectively, of 
ETOT for the present frame and for the second 
previous frame, respectively. If the ratio of ES to 
ED is larger than EDEVMF or EDEVDTMF, respec- 
tively, where EDEVMF and EDEVDTMF are pre- 

25 determined constants, then a decision is made that 
a human voice simulated an MF digit or a DTMF 
digit, respectively, i.e., taikoff has occurred. It is 
important to note that embodiments of the present 
invention provide excellent determination of taikoff 

30 because the embodiments take into account the 
history of the input signal. 

If at least two frames having a "valid" MF digit 
or "valid" DTMF digit have been detected, a deter- 
mination that the the MF digit or DTMF digit, re- 

35 spectively, has ended is made only if there is a 
drop in energy in a frame by more than one-half as 
compared with the previous frame. Lastly, if only 
one frame of a valid digit has been detected, i.e., 
the "first" frame, a determination that the MF digit 

40 or DTMF digit has ended is made: (a) when the 
next frame still has a "valid" MF digit or a "valid" 
DTMF digit, respectively, if there is a drop in 
energy in the next frame by more than one-half as 
compared with the "first" frame or (b) when the 

45 next frame has a different "valid" MF digit or 
DTMF digit or no "valid" MF digit or DTMF digit, if 
there is any energy drop. 

With the general description set forth above in 
mind, we will now describe the preferred embodi- 

50 ment of the present invention in connection with 
FIG. 2. FIG. 2 shows a flow chart of a microproces- 
sor program which forms part of the preferred 
embodiment of the inventive apparatus shown in 
FIG. 1 for detecting MF digits or DTMF digits. The 

55 following description will cover the detection of 
DTMF digits and will point out where the procedure 
is changed for the detection of MT digits. When 
MF digit or DTMF digit detector DM 10 analyzes 
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analog telephone signal 100, samples of the am- 
plitude of signal 100 are obtained in time se- 
quence. Each sample is converted to a linear PCM 
digital format by means, for example, of a compari- 
son with data stored in lookup tables in memory 60 
or in a companion ROM (not shown) and the linear 
sample data is stored in memory 60. Samples are 
accumulated and loaded, 256 samples at a time in 
the preferred embodiment, into DSP 65. DSP 65 
multiplies each sample S n by sin(2||n/512) where, in 
the preferred embodiment, the values of the half 
sine function have been previously stored in mem- 
ory. Next, DSP 65 performs a 256 point Fast 
Fourier Transform (FFT) on this data. 

Next, the real and imaginary coefficients for 
each frequency bin are squared and summed to 
provide an energy for each frequency bin. In the 
following, frequency bin indices are denoted by n 
and the energy for each such frequency bin is 
denoted by e(n). However, because the signal is 
real, only the first 128 bins are of interest. 

Next, a predetermined number of energy maxi- 
ma in the frequency spectrum of the frame are 
determined. An energy maximum is defined as the 
occurrence of a bin in the frequency spectrum of a 
frame which has more energy than its adjacent 
sidebins. Further, in accordance with a preferred 
embodiment of the present invention, the energy 
maxima determined are the three largest. Next, the 
energy in each bin in the frequency spectrum of . 
the frame is summed to provide the the total en- 
ergy of the frame ETOT. 

Because of possibly imperfect duplexing in 
telephone network interface 25, e(n) in the fre- 
quency spectrum of input signal 100 which cor- 
respond to frequencies below 600 Hz are all set to 
zero. 

Referring now to FIG. 2, box 300 is the point in 
the program wherein: (a) 128 digital values are 
obtained from input signal 100 -as described 
above if signal 100 is already in digital form a 
transformation into linear PCM digital format is per- 
formed if necessary; (b) a Discrete Fourier Trans- 
form is performed on the previous 128 values ob- 
tained and the present 128 values obtained -in this 
embodiment an FFT is performed by DSP 65- to 
provide the frequency spectrum of this group of 
values which is referred to as a frame; (c) the 
energy of each bin in the frequency spectrum of 
the frame is determined as the sum of the square 
of the real and imaginary frequency component for 
each bin; (d) the total energy of the frame, ETOT, 
is determined as the sum of the energy of all the 
frequency bins; and (e) the energy and frequency 
bin index of the three largest energy peaks in the 
frequency spectrum of the frame are determined, 
along with the energy and frequency bin index of 
the largest energy sidebins for the three largest 



energy peaks. [For MF digit detection only, param- 
eter endcnt is incremented by 1.] Then, the pro- 
gram in microprocessor 50 transfers control to de- 
cision box 310. 

s Decision box 310 is the point in the program at 

which microprocessor 50 examines the total energy 
ETOT in the present frame to determine whether 
the total energy is above or below a predetermined 
threshold, ERG. [ERG = ERGMF or ERGDTMF for 

w MF or DTMF digit detection, respectively.] If the 
total energy is below the predetermined threshold, 
the program in microprocessor 50 transfers control 
to box 320. However, if the total energy is equal to 
or above the predetermined energy level, the pro- 

75 gram transfers control to box 360. 

Box 320 is the point in the program at which 
microprocessor 50 initializes flag digflg by setting ft 
to 0. Rag digflg indicates whether input signal 100 
is in the middle of a verified or suspected MF or 

20 DTMF digit, respectively. Specifically, when digflg 
= 0, microprocessor 50 is awaiting the first frame 
which is suspected of being produced by an MF or 
a DTMF digit, respectively. Then, the program in 
microprocessor 50 transfers control to decision box 

25 330. 

Decision box 330 is the point in the program at 
which microprocessor 50 determines whether 
nframe is equal to 0 or 1 or if nframe is greater 
than 1. Flag nframe indicates the number of con- 

30 secutive frames in which an MF digit or a DTMF 
digit, respectively, is found. H nframe « 1, the 
program transfers control to box 340; if nframe = 
0, the program transfers control to box 300; and, if 
nframe is > 1 , the program transfers control to box 

35 350. 

Box 340 is the point in the program at which 
microprocessor 50 has determined that the pre- 
vious frame was suspected of having an MF digit 
or a DTMF digit present, respectively, and that the 

40 MF digit or DTMF digit, respectively, has ended in 
this frame. Then, DM 10: (a) sends a signal which 
identifies the MF digit or the DTMF digit, respec- 
tively, to host computer 30 and (b) sends a signal 
that the MF digit or DTMF digit, respectively, has 

45 ended to host computer 30. Finally, the program 
sets nframe = 0 and, if detecting an MF digit, it 
sets endcnt = 0. Then, the program transfers 
control to box 300. 

Box 350 is the point in the program at which 

so microprocessor 50 has determined that the MF 
digit or DTMF digit, respectively, which had been 
detected and identified previously has ended in this 
frame. Then, DM 10 sends a signal that the MF 
digit or DTMF digit, respectively, has ended to host 

55 computer 30. Finally, the program sets nframe = 0 
and, if detecting an MF digit, it sets endcnt = 0. 
Then, the program transfers control to box 300. 
Box 360 is the point in the program at which 
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microprocessor 50 goes through the three largest 
energy peaks and determines whether they cor- 
respond to one of the six (6) MF fundamentals or 
to one of the eight (8) DTMF fundamentals, respec- 
tively. The number of the MF fundamental (1-6) or 
DTMF fundamental (1-8), respectively, and the fre- 
quency bin index of the peak corresponding there- 
to is stored in order of increasing frequency. In 
practice one must permit the frequency of the input 
tones to deviate by predetermined amounts from 
each of the MF fundamentals or DTMF fundamen- 
tals, respectively. In order to determine the accept- 
able bounds for each fundamental, one takes the 
frequency of each fundamental and adds and sub- 
tracts a predetermined amount to provide frequen- 
cies of the upper and lower bounds, respectively, 
for each fundamental. Then, the FFT for the upper 
and lower frequency bounds are performed to pro- 
vide the maximum energy bin, largest sidebin, and 
the ratio of maximum energy to largest sidebin 
energy for the upper and lower bounds for each 
fundamental. These values are used to compare 
with the input maxima to determine whether a peak 
falls within the bounds of one of the MF fundamen- 
tals or DTMF fundamentals, respectively. 

In particular, in the preferred embodiment, the 
determination entails the following tests for MF 
digits. 

There is a match for MT fundamental 1 if: 

a) the frequency bin index of the maximum 
energy = 22 and the frequency bin index of the 
largest sidebin is 23 

or 

the frequency bin index of the largest sidebin - 
21 and energy of maximum bin > dn*(energy of 
sidebin 21) 

b) the frequency bin index of the maximum 
energy = 23 and the frequency bin index of the 
largest sidebin = 22 and energy of maximum 
bin < di2*(energy of sidebin 22) 

There is a match for MT fundamental 2 if: 

a) the frequency bin index of the maximum 
energy = 28 and energy of maximum bin < 
d2i*(energy of sidebin 29) 

b) the frequency bin index of the maximum 
ener gy = 29 and the frequency bin index of the 
largest sidebin = 28 

or 

the frequency bin index of the largest sidebin = 
30. and energy of maximum bin > d22*(energy of 
sidebin 30) 
There is a match for MT fundamental 3 if: 

a) the frequency bin index of the maximum 
energy « 34 and the frequency bin index of the 
largest sidebin =* 35 and energy of maximum 
bin > d3i "(energy of sidebin 35) 

b) the frequency bin index of the maximum 
energy = 35 



c) the frequency bin index of the maximum 
energy = 36 and the frequency bin index of the 
largest sidebin = 35 and energy of maximum 
bin < ds2*(energy of sidebin 35) 
6 There is a match for MF fundamental 4 if: 

a) the frequency bin index of the maximum 
energy 85 41 and the frequency bin index of the 
largest sidebin « 42 

or 

w the frequency bin index of the largest sidebin = 
40 and energy of maximum bin > d4i '(energy of 
sidebin 40) 

b) the frequency bin index of the maximum 
energy = 42 

75 c) the frequency bin index of the maximum 
energy = 43 and the frequency bin index of the 
largest sidebin = 42 and energy of maximum 
bin < d42*(energy of sidebin 42) 
There is a match for MF fundamental 5 if. 

20 a) the frequency bin index of the maximum 
energy = 47 and the frequency bin index of the 
largest sidebin = 48 and energy of maximum 
bin < dsT(energy of sidebin 48) 

b) the frequency bin index of the maximum 
25 energy = 48 

c) the frequency bin index of the maximum 
energy = 49 and the frequency bin index of the 
largest sidebin = 48 and energy of maximum 
bin > ds2*(energy of sidebin 48) 

30 There is a match for MF fundamental 6 if: 

a) the frequency bin index of the maximum 
energy = 53 and the frequency bin index of the 
largest sidebin = 54 and energy of maximum 
bin < d6i*(energy of sidebin 54) 

35 b) the frequency bin index of the maximum 
energy = 54 

c) the frequency bin index of the maximum 
energy = 55 and the frequency bin index of the 
largest sidebin = 54 
40 or 

the frequency bin index of the largest sidebin = 
56 and energy of maximum bin > de2*(energy of 
sidebin 56) du, di2; d2i, d22; dat, da2; cUi. d«; 
dsi. ds2; and 

45 dsi, dc2 are predetermined constants which are 
stored In memory 60. In other embodiments of the 
present invention, these constants may be down- 
loaded into memory 60 from, for example, host 
computer 30. 

so In particular, in the preferred embodiment, the 

determination entails the following tests for DTMF 
digits. 

There is a match for DTMF fundamental 1 if: 

a) the frequency bin index of the maximum 
55 energy = 22 and the frequency bin index of the 
largest sidebin is 23 
or 

the frequency bin index of the largest sidebin = 
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21 and energy of maximum bin > CiT(energy of 
sidebin 21) 

b) the frequency bin index of the maximum 
energy = 23 and the frequency bin index of the 
largest sidebin = 22 and energy of maximum 
bin < ci2*(energy of sidebin 22) 
There is a match for DTMF fundamental 2 if: 

a) the frequency bin index of the maximum 
energy = 24 and the frequency bin index of the 
largest sidebin = 25 and energy of maximum 
bin < C2i*(energy of sidebin 25) 

b) the frequency bin index of the maximum 
energy = 25 and the frequency bin index of the 
largest sidebin = 24 

or 

the frequency bin index of the largest sidebin = 
26 and energy of maximum bin > C22*(energy of 
sidebin 26) 

There is a match for DTMF fundamental 3 if: 

a) the frequency bin index of the maximum 
energy = 27 and the frequency bin index of the 
largest sidebin = 28 

or 

the frequency bin index of the largest sidebin = 
26 and energy of maximum bin > C31 "(energy of 
sidebin 26) 

b) the frequency bin index of the maximum 
energy = 28 and the frequency bin index of the 
largest sidebin = 27 and energy of maximum 
bin < Cs2*(energy of sidebin 27) 

There is a match for DTMF fundamental 4 if: 

a) the frequency bin index of the maximum 
energy = 30 and the frequency bin index of the 
largest sidebin - 31 

or 

the frequency bin index of the largest sidebin = 
29 and energy of maximum bin > C41 "(energy of 
sidebin 30) 

b) the frequency bin index of the maximum 
energy = 31 and the frequency bin index of the 
largest sidebin = 30 and energy of maximum 
bin < C42*(energy of sidebin 30) 

There is a match for DTMF fundamental 5 if: 

a) the frequency bin index of the maximum 
energy = 38 and the frequency bin index of the 
largest sidebin = 39 and energy of maximum 
bin < cs /(energy of sidebin 39) 

b) the frequency bin index of the maximum 
energy = 39 and the frequency bin index of the 
largest sidebin = 38 the frequency bin index of 
the largest sidebin = 40 and energy of maxi- 
mum bin > Cs2*(energy of sidebin 40) 

There is a match for DTMF fundamental 6 if: 

a) the frequency bin index of the maximum 
energy = 42 and the frequency bin index of the 
largest sidebin = 43 and energy of maximum 
bin < C6i*(energy of sidebin 43) 

b) the frequency bin index of the maximum 



energy = 43 and the frequency bin index of the 

largest sidebin = 42 

or 

the frequency bin index of the largest sidebin = 
s 44 and energy of maximum bin > C62*(energy of 
sidebin 44) 

There is a match for DTMF fundamental 7 if: 

a) the frequency bin index of the maximum 
energy = 47 and the frequency bin index of the 

70 largest sidebin = 48 
or 

the frequency bin index of the largest sidebin = 
46 and energy of maximum bin > C71 "(energy of 
sidebin 46) 

75 b) the frequency bin index of the maximum 
energy = 48 and the frequency bin index of the 
largest sidebin = 47 and energy of maximum 
bin < C72*(energy of sidebin 47) 
There is a match for DTMF fundamental 8 if: 

20 a) the frequency bin index of the maximum 
energy = 51 and the frequency bin index of the 
largest sidebin = 52 and energy of maximum 
bin < C81 "(energy of sidebin 52) 

b) the frequency bin index of the maximum 
25 energy - 52 

c) the frequency bin index of the maximum 
energy = 53 and the frequency bin index of the 
largest sidebin * 52 

or 

30 the frequency bin index of the largest sidebin = 
54 and energy of maximum bin > Cs2*(energy of 
sidebin 54). Cn, C12; C2i, C22; 031, C32*. c*i, C42; 

Csi, C52; C61, 

C62; C71, C72; and csi, C82; are predetermined 

35 constants which are stored in memory 60. In other 
embodiments of the present invention, these con- 
stants may be downloaded into memory 60 from, 
for example, host computer 30. Then, the program 
transfers control to decision box 370. 

40 Decision box 370 is the point in the program at 

which microprocessor 50 tests to see if digflg = 0. 
If digflg = 0, the program is searching for the first 
frame which is suspected of containing an MF digit 
or a DTMF digit, respectively, and, as a result, the 

45 program transfers control to the portion of the 
program which corresponds to decision box 380. If 
digflg = 1 , an MF digit or a DTMF digit, respec- 
tively, is suspected or is verified as having been 
detected and, as a result, control is transferred to 

50 the portion of the program which corresponds to 
box 460. 

Decision box 380 is the point in the program at 
which microprocessor 50 decodes the maxima to 
determine whether the pair of the two largest maxi- 
55 ma corresponds to a valid MF digit or DTMF digit, 
respectively. For example, for MF digits: if the first 
maximum corresponds to the first MF fundamental: 
(a) the MF digit is "1 tt if the second maximum 

10 
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corresponds to the second MF fundamental; (b) the 
MF digit is "2" if the second maximum corre- 
sponds to the third MF fundamental; and so forth. 
Further, for DTMF digits, if the first maximum cor- 
responds to the first lowband DTMF fundamental: 
(a) the DTMF digit is "1" if the second maximum 
corresponds to the first highband DTMF fundamen- 
tal; (b) the DTMF digit is "2 W if the second maxi- 
mum corresponds to the second highband DTMF 
fundamental; and so forth. If a valid MF digit or 
DTMF digit is found, respectively, the program 
transfers control to decision box 390, however, if 
no valid MF digit or DTMF digit is found, respec- 
tively, the program transfers control to box 300. 

Decision box 390 is the point in the program at 
which microprocessor 50 tests to determine wheth- 
er ETOT for this frame, is greater than or equal to 
ETOT for the previous frame. If ETOT for this 
frame is less than ETOT for the previous frame, 
then the energy is decreasing and an MF digit or a 
DTMF digit, respectively, is not recognized as be- 
ing valid. Thus, if ETOT for this frame is less than 
ETOT for the previous frame, the program transfers 
control to box 300, otherwise, control is transferred 
to box 400. 

Box 400 is the point in the program at which 
microprocessor 50 determines the energy of the 
two MF fundamentals or DTMF fundamentals, re- 
spectively, as the sum of the maximum energy and 
the energy of the largest sidebin. The program 
then transfers control to decision box 410. 

Decision box 410 is the point in the program at 
which microprocessor 50 compares the amount of 
MF energy and non-MF energy or DTMF energy 
and non-DTMF energy, respectively, In the present 
frame. The non-MF energy is determined to be 
equal to the difference between ETOT and the MF 
digit energy -the MF digit energy equals the sum 
of the MF fundamentals- and the non-DTMF en- 
ergy is determined to be equal to the difference 
between ETOT and the DTMF digit energy -the 
DTMF digit energy equals the sum of the DTMF 
fundamentals. Then the ratio of ETOT to non-MF 
energy or the ratio of ETOT to non-DTMF energy 
is determined and if the ratio greater than 
BROADMF or BROADDTMF, respectively, where 
BROADMF and BROADDTMF are predetermined 
amounts, the program transfers control to decision 
box 420. If, however, the ratio of ETOT to non-MF 
energy or the ratio of ETOT to non-DTMF energy 
is too small, respectively, we do not have a valid 
MF digit or DTMF digit, respectively, and the pro- 
gram transfers control to box 300. 

Decision box 420 is the point in the program at 
which microprocessor 50 tests the signal-noise- 
ratio in this frame. The ratio of the MF digit energy 
and the energy of the third maximum or the ratio of 
the DTMF digit energy and the energy of the third 



maximum is compared with SNRMF or SNRDTMF, 
respectively, where SNRMF and SNRDTMF are 
predetermined constants. If the ratio of the MF digit 
energy to the energy of the third maximum or the 

5 ratio of DTMF digit energy to the energy of the 
third maximum, i.e., the signal-to-noise ratio, is 
greater than or equal to SNRMF or SNRDTMF, 
respectively, then we have passed the test and the 
program transfers control to decision box 430. If, 

io however, the respective ratio falls below SNRMF or 
SNRDTMF, respectively, we do not have a valid 
MF digit or DTMF digit, respectively, and the pro- 
gram transfers control to box 300. 

Decision box 430 is the point in the program at 

75 which microprocessor 50 examines twist, i.e., the 
ratio of the fundamental energy of the larger en- 
ergy MF fundamental of the MF digit to the energy 
of the smaller energy MF fundamental or the ratio 
of the fundamental energy of the larger energy 

20 DTMF fundamental of the DTMF digit to the energy 
of the smaller energy DTMF fundamental. If the 
respective ratio is smaller than TWISTMF or 
TWISTDTMF, respectively, where TWISTMF and 
TWISTDTMF are predetermined constants, then 

25 the program transfers control to decision box 440. 
If the respective ratio is greater than or equal to 
TWISTMF or TWISTDTMF, respectively, we do not 
have a valid MF digit or DTMF digit, respectively, 
and the program transfers control to box 300. 

30 Decision box 440 is the point in the program at 

which microprocessor 50 determines talkoff. In or- 
der to do so, the following quantities are deter- 
mined. ED and ES are the difference and sum, 
respectively, of ETOT for the present frame and for 

35 the second previous frame, respectively. For MF 
digit detection, if the ratio of ES to ED is larger 
than EDEVMF, where EDEVMF is a predetermined 
constant, and endcnt is greater than or equal to 3, 
then a decision is made that a human voice simu- 

40 lated an MF digit and the program transfers control 
to box 300, otherwise the program transfers control 
to box 450. For DTMF digit detection, if the ratio of 
ES to ED is larger than EDEVDTMF, where 
EDEVDTMF is a predetermined constant, then a 

46 decision is made that a human voice simulated a 
DTMF digit and the program transfers control to 
box 300, otherwise the program transfers control to 
box 450. 

Box 450 is the point in the program at which 
so microprocessor 50 has determined that a suspect- 
ed MF digit or DTMF digit, respectively, is present. 
Flags digflg and nframe are set - 1 , the identity of 
the suspected MF digit or DTMF digit, respectively, 
is saved, and the program transfers control to box 
55 300. 

Box 460 is the point in the program at which 
microprocessor 50 decodes the maxima, in the 
same manner as described for box 380, to deter- 
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mine whether the present frame has a valid MF 
digit or DTMF digit, respectively. The program then 
transfers control to decision box 470. 

Decision box 470 is the point in the program at 
which microprocessor 50 determines whether 
nframe = 1. If nframe is greater than 1, this 
indicates that the MF digit or DTMF digit, respec- 
tively, has already been verified and a search is 
being made for the end of the MF digit or DTMF 
digit, respectively. Thus, if nframe > 1 ( the program 
transfers control to decision box 480. If, however, 
nframe - 1, this indicates that we merely suspect 
the presence of an MF digit or a DTMF digit, 
respectively, and the program transfers control to 
decision box 510. 

Decision box 480 is the point in the program at 
which microprocessor 50 determines whether there 
has been a large dropoff in total energy in this 
frame as compared to the previous frame. Thus, if 
2*ETOT for the present frame is less than ETOT for 
the previous frame, then the program transfers 
control to box 500, otherwise, the program trans- 
fers control to box 490. 

Box 490 is the point in the program at which 
microprocessor 50 has determined the continuing 
presence of the MF digit or DTMF digit, respec- 
tively. This is verified by incrementing nframe by 1 
and then the program transfers control to box 300. 

Box 500 is the point in the program at which 
microprocessor 50 has determined that the energy 
has fallen so low that we have decided that the MF 
digit or DTMF digit, respectively, ended. The flags 
digflg and nframe are reset to 0 [for MF digit 
detection, endcnt is also reset to 0], DM 10 sends 
a signal which indicates that the MF digit or DTMF 
digit, respectively, has ended to host computer 30 
and the program transfers control to box 300. 

Decision box 510 is the point in the program at 
which microprocessor 50 matches the present MF 
digit or DTMF digit, respectively, with the MF digit 
or DTMF digit, respectively, obtained from the pre- 
vious frame. If the two MF digits or DTMF digits, 
respectively, are the same, the program transfers 
control to box 515, otherwise, the program trans- 
fers control to decision box 530. 

Box 515 is the point in the program at which 
DM 10 sends a signal to host computer 30 which 
identifies the start of an MF digit or a DTMF digit, 
respectively, and which identifies the MF digit or 
the DTMF digit, respectively. The program then 
transfers control to decision box 520. 

Decision box 520 is the point in the program at 
which microprocessor 50 determines whether there 
has been a large dropoff in total energy in the 
present frame as compared to the previous frame. 
Thus, if 2*ETOT for the present frame is less than 
ETOT for the previous frame, the program transfers 
control to box 500, otherwise, the program trans- 



fers control to box 490. 

Decision box 530 is the point in the program at 
which microprocessor 50 determines whether the 
present frame has- a valid MF digit or DTMF digit, 

5 respectively, after the previous frame had a pre- 
sumably valid MF digit or DTMF digit respectively. 
If no valid digit has been found, then the program 
transfers control to decision box 540, otherwise the 
program transfers control to decision box 560. 

w Decision box 540 is the point in the program at 

which microprocessor 50 has determined that there 
is no valid MF digit or DTMF digit, respectively, 
present in this frame even though a presumably 
valid MF digit or DTMF digit, respectively, was 

15 present in the previous frame. If the drop in energy 
from the previous frame is not large, then a de- 
cision is made that there was no valid MF digit or 
DTMF digit, respectively, in the previous frames 
because a valid MF digit or DTMF digit, respec- 

20 tively, is expected to end with a large drop in 
energy. For MF digits, if 1 .33*ETOT for the present 
frame is larger than ETOT for the previous frame, 
the suspected digit is dropped by resetting flags 
digflg, endcnt, and nframe to 0 and the program 

25 recycles to look for a MF digit —all of which is done 
when the program transfers control to box 545. For 
DTMF digits, if 2*ETOT for the present frame is 
larger than ETOT for the previous frame, the sus- 
pected digit is dropped by resetting flags digflg 

30 and nframe to 0 and the program recycles to look 
for a DTMF digit -all of which is done when the 
program transfers control to box 545. If, however, 
there is a large falloff in energy from the previous 
frame to the present frame, the program transfers 

35 control to box 550. 

Box 550 is the point in the program where 
there is a large falloff in energy from the previous 
frame to the present frame. Flags digflg and 
nframe are set to zero to indicate the end of a digit 

40 [for MF digits, endcnt is also set to zero], a signal 
which identifies the start and the identity of the MF 
digit or DTMF digit, respectively, is sent to host 
computer 30, and a signal which indicates the end 
of the MF digit or DTMF digit, respectively, is sent 

45 to host computer 30. Finally, the program transfers 
control to box 300. 

Decision box 560 is the point in the program at 
which microprocessor 50 has determined the pres- 
ence of a different MF digit or DTMF digit, respec- 

50 tively, than the MF digit or DTMF digit respec- 
tively, which was detected in the previous frame. If 
the total energy of the present frame is less than 
the total energy of the previous frame, signifying a 
drop in energy, we end the previous digit and 

55 ignore the present digit. This is done by transfer- 
ring control to box 550. If, however, the total en- 
ergy of the present frame is greater than the total 
energy of the previous frame, we ignore the results 

12 
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of the analysis of the previous frame and the pro- 
gram transfers control to box 400. 

As should be clear to those of ordinary skill in 
the art, the embodiment of the present invention 
which was described in detail above is an MF and 
DTMF detector which analyzes an input signal and, 
in response thereto, generates an identifier signal 
for use by another apparatus such as host com- 
puter 30- For example, the another apparatus can 
be an interactive system which can transmit voice 
signals to a human in the form of information and 
queries and can receive responses from the human 
in the form of various combinations of DTMF digits 
which are conveniently available from a Touch 
Tone telephone. In such a system, embodiments of 
the present invention advantageously provide de- 
tection and identification of the DTMF digits which 
are transmitted to the iterative system. 

As should be clear to those of ordinary skill in 
the art, further embodiments of the present inven- 
tion may be made without departing from its teach- 
ings. For example, in accordance with the present 
invention, the energy in the frequency bins in the 
frequency spectrum of a frame of the signal, e(n), 
may be determined in many different ways. In 
particular, in another embodiment of the present 
invention, e(n) equals the sum of the absolute value 
of the real part of the component of frequency bin 
n and the absolute value of the imaginary part of 
the component of frequency bin n. 

As should be clear to those of ordinary skill in 
the art, en; C12; C21; 022; Cai; C32; C41; c«; csi; 
C52; Cei; C62; C71; C72; Csi; Cs2l ERGDTMF; 
BROAD DTMF; SNRDTMF; TWISTDTMF; 
EDEVDTMF; dn; di2; d2i; d22l d3i; d32; d*i; 6a2\ 
dsi; dsa; d G i; d 6 2l ERGMF; BROADMF; SNRMF; 
TWISTMF; and EDEVMF are predetermined con- 
stants which are stored in memory 60. In other 
embodiments of the present invention, these con- 
stants may be downloaded into memory 60 from, 
for example, host computer 30 to make the detec- 
tor easily customized for a particular application. 
For example, in a preferred embodiment of the 
present invention, Cn = 8; C12 = 1; C21 = 2; C22 
= 16; C31 = 8; C32 = 2; C41 = 2; c« = 1; C51 = 
4; C52 = 4; c G i, = 4; c 6 2 = 2; C71 = 2; C72 = 4; 
csi = 1; C82 = 8; ERGDTMF = 250; 
BROAD DTMF = 4; SNRDTMF = 89; TWISTDTMF 
« 10; EDEVDTMF = 3; dn = 1; di 2 = 6; d 2 i = 
4; d 2 2 = 1; <*3i = 2; **2 = 6; d+i =1; d*2 = 2; 
dsi * 6; d 5 2 = 6; dsi = 2; d 62 = 1; ERGMF = 
40; BROADMF = 2; SNRMF = 12; TWISTMF = 
10; and EDEVMF « 3. 

In addition, we have been able to reduce the 
time it takes to obtain the FFT for two channels by 
utilizing channel 1 as the real array input and 
channel 2 as the imaginary array input to the 256 
point, complex FFT performed on this data by DSP 



65. As a result, in such an embodiment, we are 
able to determine the fourier spectrum of 2 chan- 
nels simultaneously. The respective fourier spec- 
trum coefficients for each channel are determined 
5 from the output of DSP 65 in accordance with the 
following: 

Rch.1(n) = [R(n) + R(256-n)] 

Ich.1(n) = [l(n) - l(256-n)] 

Rch.2(n) - [l(n) + l(256-n)] 
10 lch.2(n) m - [R(n) - R(256-n)] 
where: 

R(n) is the real fourier spectrum result 

coefficient for the nth frequency bin 
I(n) is the imaginary fourier spectrum 

75 result coefficient for the nth fre- 

quency bin 

Rch.1(n) is real fourier spectrum coefficient 
for the nth frequency bin for chan- 
nel 1 

20 lch.1(n) is the imaginary fourier spectrum 
coefficient for the nth frequency bin 
for channel 1 



25 

Claims 

1. A method for detecting and identifying pre- 
determined digits in an analog or digital signal, 
30 a digit being comprised of two predetermined, 

substantially single frequency tones, which 
method comprises the steps of: 

(a) obtaining digital measures of the am- 
plitude of the signal, if the signal is a digital 

35 signal, or digital measures of a sample of 

the amplitude of the signal, if the signal is 
an analog signal; 

(b) determining a measure of energy of 
frequency bins for a predetermined portion 

40 of the frequency spectrum of a frame com- 

prised of a predetermined number of the 
digital measures; 

(c) determining the energy ETOT for the 
predetermined portion by summing the 

46 measure of energy of the frequency bins; 

(d) determining the energy of a predeter- 
mined number of maxima of the measure of 
energy of the frequency bins; 

(e) comparing at least one of ETOT, the 
50 predetermined number of energy maxima, 

and predetermined sums of energy mea- 
sures with predetermined thresholds; 

(f) determining whether a comparison be- 
tween at least one of ETOT, the predeter- 

55 mined number of energy maxima, and pre- 

determined sums of energy measures for 
the present frame and for predetermined 
ones of previous frames satisfies predeter- 
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mined criteria; and 
(g) identifying the digit 

2. The method of claim 1 wherein the predeter- 
mined ones of previous frames of step (f) 
comprises the second previous frame. 

3. The method of claim 1 wherein successive 
frames of step (b) are comprised of a predeter- 
mined number of the same digital measures. 

4- The method of claim 1 wherein step (b) com- 
prises obtaining real and imaginary parts of the 
contents of the frequency bins by fourier trans- 
form. 

5. The method of claim 4 wherein step (b) further 
comprises summing the absolute value of the 
real part and the absolute value of the imagi- 
nary part of the contents of a frequency bin. 

6. The method of claim 4 wherein step (b) further 
comprises summing the square of the real part 
and the square of the imaginary part of the 
contents of a frequency bin. 

7. The method of claim 1 wherein step (a) further 
comprises a step of converting the digital mea- 
sures into a linear PCM digital format, if the 
signal is an analog signal, or converting the 
digital measures into a linear PCM digital for- 
mat, if the signal is a digital signal and is not 
already so encoded; 

8. The method of claim 1 wherein step (e) com- 
prises: 

testl step of determining whether 
ETOT falls below a testl predetermined 
threshold and. if so, returning to step (a); 

test2 step of determining whether any 
of the predetermined number of energy maxi- 
ma are within a predetermined degree of 
closeness to one of the predetermined single 
frequency tones and determining whether the 
energy of the two largest energy maxima cor- 
respond to one of a predetermined number of 
possible digits and, if not, returning to step (a); 

test3 step of determining whether 
ETOT for the present frame is less than ETOT 
for the previous frame and, if so, returning to 
step (a); 

test4 step of determining whether the 
ratio of ETOT to non-digit energy is less than a 
test4 predetermined threshold and, if so, re- 
turning to step (a); 

tests step of determining whether the 
ratio of the digit energy and background en- 
ergy is less than a tests predetermined thresh- 



old and, if so, returning to step (a); and 

test6 step of determining whether the 
ratio of energy of the larger energy single 
frequency tone of the digit to energy of the 
s smaller energy single frequency tone of the 

digit is less than a test6 predetermined thresh- 
old and, if not, returning to step (a). 

9. The method of claim 8 wherein the test4 step 
10 comprises determining the non-digit energy to 

be equal to ETOT minus the sum of the en- 
ergy of the two largest energy maxima 

10. The method of claim 8 wherein the tests step 
T5 comprises determining the background energy 

to be equal to the energy of the third largest 
energy maximum. 

11. The method of claim 1 wherein step (f) com- 
20 prises determining a difference, ED, and a 

sum, ES, respectively, of ETOT for the present 
frame and for the second previous frame, re- 
spectively, and determining whether the ratio 
of ES and ED is greater than an historical 
25 predetermined amount and, if so, returning to 

step (a). 

12. The method of claim 1 which further comprises 
step (h) of detecting the end of the digit. 

30 

13. The method of claim 12 wherein step (h) com- 
prises determining when ETOT falls below an 
ending predetermined threshold with respect to 
predetermined preceding frames. 

35 

14. The method of claim 8 wherein step (g) com- 
prises identifying the digit after a first frame 
has passed through step (f) and the next frame 
has passed through predetermined steps of 

40 testl step to test6 step. 

15. The method of claim 14 wherein the predeter- 
mined steps comprise testl step and test2 
step. 

45 

16. An apparatus for detecting and identifying pre- 
determined digits in an analog or digital signal, 
a digit being comprised of two predetermined, 
substantially single frequency tones, which ap- 

so paratus comprises: 

obtaining means for obtaining digital 
measures of the amplitude of the signal, if the 
signal is a digital signal, or digital measures of 
a sample of the amplitude of the signal, if the 
55 signal is an analog signal and for sending a 

signal to energy means; 

the energy means, responsive to the 
signal from the obtaining means, for determin- 

14 
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ing a measure of energy of frequency bins for 
a predetermined portion of the frequency 
spectrum of a frame comprised of a predeter- 
mined number of the digital measures; for de- 
termining the energy ETOT for the predeter- 
mined portion by summing the measure of 
energy of the frequency bins; for determining 
the energy of a predetermined number of 
maxima of the measure of energy of the fre- 
quency bins, and for sending a signal to first 
comparing means; 

the first comparing means, responsive 
to the signal from the energy means, for com- 
paring at least one of ETOT, the predeter- 
mined number of energy maxima, and pre- 
determined sums of energy measures with 
predetermined thresholds and, if the tests are 
passed, for sending a signal to a second com- 
paring means, otherwise, sending a signal to 
the obtaining means; 

the second comparing means, respon- 
sive to the signal from the first comparing 
means, for determining whether a comparison 
between at least one of ETOT, the predeter- 
mined number of energy maxima, and pre- 
determined sums of energy measures for the 
present frame and for predetermined ones of 
previous frames satisfies predetermined cri- 
teria and, if so, setting an indicator and send- 
ing a signal to the obtaining means, otherwise, 
sending a signal to the obtaining means. 

17. The apparatus of claim 16 wherein the first 
comparing means comprises: 

testl means, responsive to the signal 
from the energy means, for determining wheth- 
er ETOT falls below a testl predetermined 
threshold and, if so, sending a signal to the 
obtaining means, otherwise, sending a signal 
to test2 means; 

the test2 means, responsive to the sig- 
nal from the testl means, for determining 
whether any of the predetermined number of 
energy maxima are within a predetermined de- 
gree of closeness to one of the possible fun- 
damental frequencies and for determining 
whether the energy of the two largest energy 
maxima correspond to one of a predetermined 
number of possible digits and, if not sending a 
signal to the obtaining means, otherwise, send- 
ing a signal to test3 means; 

test3 means, responsive to the signal 
from the test2 means, for determining whether 
ETOT for the present frame is less than ETOT 
for the previous frame and, if so, sending a 
signal to the obtaining means, otherwise, send- 
ing a signal to test4 means; • 

the test4 means, responsive to the sig- 



nal from the test3 means, for determining 
whether the ratio of ETOT to non-digit energy 
is less than a test4 predetermined threshold 
and, if so, sending a signal to the obtaining 
5 means, otherwise, sending a signal to tests 

means; 

tests means, responsive to the signal 
from the test4 means, for determining whether 
the ratio of digit energy and background en- 
io ergy is less than a tests predetermined thresh- 

old and, if so, sending a signal to the obtaining 
means, otherwise, sending a signal to test6 
means; 

the test6 means, responsive to the sig- 
75 nal from the tests means, for determining 

whether the ratio of energy of the larger en- 
ergy single frequency tone of the digit to en- 
ergy of the smaller energy single frequency 
tone of the digit is less than a test6 predeter- 
20 mined threshold and, if not, sending a signal to 

the obtaining means, otherwise, sending a sig- 
nal to the second comparison means. 

18- The apparatus of claim 17 wherein: 
25 the test2 means further comprises 

means for generating a signal identifying the 
digit instead of sending a signal to test3 means 
whenever the indicator for the previous frame 
is set. 
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